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(54) A method for de novo peptide sequence determination 



(57) A method for determining the amino acid se- 
quence of an unknown peptide comprising 

(a) determining a molecular mass and an experi- 
mental fragmentation spectrum for the unknown 
peptide; 

(b) comparing the experimental fragmentation 
spectrum of the unknown peptide to theoretical 



fragmentation spectra calculated for a peptide li- 
brary composed of all possible linear sequences of 
amino acids having a total mass that corresponds 
to the molecular mass of the unknown peptide; and 
(c) identifying a peptide in the peptide library having 
a theoretical fragmentation spectrum that matches 
most closely the fragmentation spectrum of the un- 
known peptide. 
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Description 

FIELD OF THE INVENTION 

5 The invention relates to a method for the determination of the precise linear sequence of amino acids in a peptide, 

polypeptide or protein, without recourse or reference to either a known pre-defined data base or to sequential amino 
acid residue analysis. As such, the method of the invention is a true, de novo peptide sequence determination method. 

BACKGROUND OF THE INVENTION 

10 

The composition of a peptide (which term includes also polypeptide or protein) as a sequence of amino acids is 
well understood. Each peptide is uniquely defined by a precise linear sequence of amino acids Knowledge of the 
precise linear arrangement or sequence of amino acids in a peptide is required for various purposes, including DNA 
cloning in which the sequence of amino acids provides information required for oligonucleotide probes and polymerase 

15 chain reaction ("PGR') primers. Knowledge of the exact sequence also allows the synthesis of peptides for antibody 
production, provides identification of peptides, aids in the characterization of recombinant products, and is useful in 
the study of post-translational modifications 

A variety of sequencing methods are available to obtain the amino acid sequence information. For example, a 
series of chemical reactions, e.g., Edman reactions, or enzymatic reactions, e.g., exo-peptidase reactions, are used 

20 to prepare sequential fragments of the unknown peptide. Either an analysis of the sequential fragments or a sequential 
analysis ofthe removed amino acids is used to determine the linear amino acid sequence of the unknown peptide. 
Typically, the Edman degradation chemistry is used in modem autorrated protein sequencers. 

In the Edman degradation, a peptide is sequenced by degradation from the N-terminus using the Edman reagent, 
phenylisothiocyanate (PITC) The degradation process involves three steps, i e., coupling, cleavage, and conversion 

25 In the coupling step, PITC modifies the N-terminal residue of the peptide, polypeptide, or protein. An acid cleavage 
then cleaves the N-terminal amino acid in the form of an unstable anilinothiazolinone (ATZ) derivative, and leaves the 
peptide with a reactive N-terminus and shortened by one amino acid. The ATZ derivative is converted to a stable 
phenylthiohydantoin in the conversion step for identification, typically with reverse phase high performance liquid chro- 
matography (RP-HPLC). The shortened peptide is left with a free N4erminus that can undergo another cycle of the 

30 degradation reaction. Repetition of the cycle results in the sequential identification of each amino acid in the peptide. 
Because ofthe sequential nature of amino acid release, only one molecular substance can be sequenced at a time. 
Therefore, peptide samples must be extremely pure for accurate and efficient sequencing. Typically, samples must be 
purified with HPLC or SDS-PAGE techniques. 

Although many peptide sequences have been determined by Edman degradation, currently most peptide sequenc- 

35 es are deduced from DNA sequences determined from the corresponding gene or cDN A However, the determination 
of a protein sequence using a DNA sequencing technique requires knowledge of the specific nucleotide sequence 
used to synthesize the protein. DNA sequencing cannot be used where the nature of the protein or the specific DNA 
sequence used to synthesize the protein is unknown. 

A peptide sequence may also be determined from experimental fragmentation spectra of the unknown peptide, 

40 typically obtained using activation or collision-induced fragmentation in a mass spectrometer Tandem mass spectrom- 
etry (MS/MS) techniques have been particularly useful. In MS/MS, a peptide is first purified, and then injected into a 
first mass spectrometer. This first mass spectrometer serves as a selection device, and selects a target peptide of a 
particular molecular mass from a mixture of peptides, and eliminates most contaminants from the analysis. The target 
molecule is then activated or fragmented to form a mixture from the target or parent peptide of various peptides of a 

45 lower mass that are fragments of the parent. The mixture is then selected through a second mass spectrometer (i.e. 
step), generating a fragment spectrum. 

Typically, in the past, the analysis of fragmentation spectra to determine peptide sequences has involved hypoth- 
esizing one or more amino acid sequences based on the fragmentation spectrum. In certain favorable cases, an expert 
researcher can interpret the fragmentation spectra to determine the linear amino acid sequence of an unknown peptide. 

50 The candidate sequences may then be compared with known amino acid sequences in protein sequence libraries. 

In one strategy, the mass of each amino acid is subtracted from the molecular mass of the parent peptide to 
determine the possible molecular mass of a fragment, assuming that each amino acid is in a terminal position. The 
experimental fragment spectrum is then examined to determine if a fragment with such a mass is present A score is 
generated for each amino acid, and the scores are sorted to generate a list of partial sequences for the next subtraction 

55 cycle. The subtraction cycle is repeated until subtraction of the mass of an amino acid leaves a difference of between 
-0 5 and 0.5. resulting in one or more candidate amino acid sequences. The highest scoring candidate sequences are 
then compared to sequences in a library of known protein sequences in an attempt to identify a protein having a sub- 
sequence similar or identical to the candidate sequence that generated the fragment spectrum. 
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Although useful in certain contexts, there are difficulties related to hypothesizing candidate ammo acid sequences 
based on fragmentation spectra. The interpretation of fragmentation spectra is time-consuming, can generally be per- 
formed only in a few laboratones that have extensive experience with mass spectrometry, and is highly technical and 
often inaccurate. Human interpretation is relatively slow, and may be highly subjective. Moreover, methods based on 

5 peptide mass mapping are limited to peptide masses derived from an intact homogeneous peptide generated by spe- 
cific, known proteolytic cleavage, and, thus, are not applicable in general to a mixture of peptides. 

U.S. Patent No. 5,538,897 to Yates, III et al. provides a method of correlating the fragmentation spectrum of an 
unknown peptide with theorotical spectra calculated from doscnbed peptide sequences stored in a database to match 
the amino acid sequence of the unknown peptide to that of a described peptide. Known amino acid sequences, e.g., 

10 in a protein sequence library, are used to calculate or predict one or more candidate fragment spectra. The predicted 
fragment spectra are then compared with the experimentally-obtained fragment spectrum ofthe unknown protein to 
determine the best match or nnatches Preferably, the mass ofthe unknown peptide is known Sub-sequences of the 
various sequences in the protein sequence library are analyzed to identify those sub-sequences corresponding to a 
peptide having a mass equal to or within a given tolerance of the mass of the parent peptide in the fragmentation 

?5 spectrum. For each sub-sequence having the proper mass, a predicted fragment spectrum can be calculated by cal- 
culating masses ofvarious amino acid subsets ofthe candidate peptide. As a result, a plurality of candidate peptides, 
each having a predicted fragment spectrum, is obtained. The predicted fragment spectra are then compared with the 
fragment spectrum obtained experimentally for the unknown protein, to identify one or more proteins having sub-se- 
quences that are likely to be identical to the sequence of peptides that resulted in the experimentally -derived fragment 

20 spectrum. However, this technique cannot be used to derive the sequence of unknown, novel proteins or peptides 
having no sequence or sub-sequence identity with those pre-described or contained in such databases, and, thus, is 
not a de novo sequencing method. 

Therefore, there remains a need for a true de novo sequencing method of determining the amino acid sequence 
of a peptide using mass spectrometry. 

25 

SUMMARY OF THE INVENTION 

The present invention is directed to a method for generating a library of peptides, wherein each peptide in the 
library has a molecular mass corresponding to the same predetermined molecular mass. Typically, the library ofpeptides 

30 is then used to determine the amino acid sequence of an unknown peptide having the predetermined molecular mass. 
Preferably, the predetemnined molecular mass used to generate the library is the molecular mass of the unknown 
peptide. Most preferably, the molecular mass of the unknown peptide is determined prior to the generation of the library 
using a mass spectrometer, such as a time-of-flight mass spectrometer. 

The library is synthetic, i.e., not pre-described, and is typically generated each time a peptide is analyzed, based 

35 on the predetenmined molecular mass of the unknown peptide. The library is generated by defining a set of all allowed 
combinations of amino ackJs that can be present in the unknown peptide, where the molecular mass of each combi- 
nation corresponds to the predetermined molecular mass within the experimental accuracy of the device used to de- 
termine the molecular mass, allowing for water lost in peptide bond formation and for protonation, and generating an 
allowed library of all possible permutations of the linear sequence of amino acids in each combination in the set. 

40 Generally, the present invention is directed to a method for determining the amino acid sequence of an unknown 

peptide, which comprises determining a molecular mass and an experimental fragmentation spectrum for the unknown 
peptide, comparing the experimental fragmentation spectrum of the unknown peptide to theoretical fragmentation spec- 
tra calculated for each individual member of an allowed synthetic peptide library, where the allowed peptide library is 
of the type described above, and identifying a peptide in the peptide library having a theoretical fragmentation spectnjm 

45 that matches most closely the fragmentation spectrum of the unknown peptide, from which it is inferred that the amino 
acid sequence of the identified peptide in the allowed library represents the amino acid sequence of the unknown 
peptide. 

The nnolecular mass for the unknown peptide may be determined by any means known in the art, but is preferably 
determined with a mass spectrometer. Allowed combinations of amino acids are chosen from a set of allowed amino 

50 acids that typically comprises the natural amino acids, i e., tryptophan, arginine, histidine, glutamic acid, glutamtne, 
aspartic acid, leucine, threonine, proline, alanine, tyrosine, phenylalanine, methionine, lysine, asparagine, isoleucine, 
cysteine, valine, serine, and glycine but may also include other amino acids, including, but not limited to, non-natural 
amino acids and chemically modified derivatives ofthe natural amino acids, e.g. , carbamidocysteine and deoxymethio- 
nine. Allowed combinations of amino acids are then calculated using one or more individual members of this set of 

55 amino acids, allowing for known mass changes associated with peptide bond formation, such that the total mass of 
each allowed combination corresponds to the predetermined mass of the unknown peptide to within the experimental 
accuracy to which this molecular mass of the unknown peptide was calculated, typically about 30 ppm. The set of 
allowed combinations is most easily calculated using an appropriately programmed computer The allowed peptide 
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library is assembled by permutation in all possible linear combinations of each allowed amino acid composition, and 
is also most easily constructed using an appropriately programmed computer It should be noted that the term 'allowed' 
with respect to amino acid combinations and libraries of peptides refers to combinations and libraries specific to the 
unknown peptide under investigation. The peptide library is constructed from the amino acid combinations, which in 
5 turn are calculated from the experimentally determined molecular mass. As unknown peptides of different mass are 
investigated, so different combinations of amino acids are allowed, and hence each unknown peptide of unique mo- 
lecular mass gives rise to a unique peptide library. 

Tho prosont invention constrains the allowed library, i.e. limits the number of poeeiblo eoquoncoe. In the broadest 
aspect of the invention, this constraint is achieved by determining a nrolecular mass for the peptide whose sequences 

10 is to be determined, i.e the unknown peptide 

According to preferred embodiments ofthe invention, information, e.g available from the experimental fragmen- 
tation spectrum of the unknown peptide, can be used to put further constraints on the number of possible sequences 
of amino acids in the peptide library. For example, the immonium ton region of the mass spectrum used to determine 
the nrtolecular mass may also be used to identify amino acids contained in the unknown peptide. Altematively or in 

?5 addition, the two N-tenminal amino acids may be identified from the b2/a2 ion pairs. For example, the two N-terminal 
amino acids may be deduced from the prominent signals of the bg and ag ions. In particular, the identity of the signals 
may be determined by recognition of signals separated by 27.98 a.m.u. (corresponding to CO) in the region ofthe 
spectrum which includes the mass of all possible combinations of modified and unnnodified amino acids. Further, based 
on the use of enzyme treatment, e.g. with a protease such as papain, chymotrypsin or trypsin, the C-terminal residue 

20 of any peptide in the spectrum is determined as either arginine or lysine, and this may be confirmed or identified from 
the recognition of signals at 175.11 and 147.11 respectively. Altematively, C-terminals containing basic amino acids 
can be identified by recognition ofthe predicted y^ ion. The spectrum can be interpreted to identify the next amino acids. 

Another means of applying a constraint on the allowed library of amino acids is to obtain partial internal sequence 
information, e g by identifying the y series ofions with appropriate defined accuracy of mass measurement. In particular, 

25 a computer programme may be used to recognise at least three sequential signals separated by the nnass of all possible 
modified and unmodified amino acid residues. The differences between these signals allows identification of a se- 
quence of two amino acids. Most preferably, the molecular mass of the unknown peptide and at least one other exper- 
imental parameter e.g. as given above, are used as constraints in initially generating the library of allowed peptides. 
The nature ofthe fragmentatbn process from which the theoretical fragmentatbn spectrum is calculated for every 

30 peptide in the allowed library may be of any type known in the art, such as a mass spectrum or a protease or chemical 
fragmentation spectrum. Preferably, both the molecular mass and the fragmentation spectrum for the unknown peptide 
are obtained from a tandem mass spectrometer The amino acid sequence of the peptide from the allowed library of 
peptides, having a calculated fragmentation spectrum that best fits the experimental fragmentation spectrum of the 
unknown peptide, corresponds to the ammo acid sequence of the unknown peptide. 

3S Although not required, the experimental fragmentation spectrum is generally normalized. A factor that is an indi- 

cation of closeness-of-fit between the experimental fragmentation spectrum of the unknown peptide and each of the 
theoretical fragmentation spectra calculated for the peptide library may then be calculated to determine which of the 
theoretical fragmentation spectra best fits the experimental fragmentation spectrum. Preferably, peak values in the 
fragmentation spectra having an intensity greater than a predetermined threshold value are selected when calculating 

40 the indication of closeness-of-fit. The theoretical fragmentatbn spectrum that best fits the experimental fragmentation 
spectrum corresponds to the amino acb sequence in the allowed library that matches that of the unknown peptide. 

BRIEF DESCRIPTION OF THE DRAWINGS 

45 Fig 1 is a flow chart of the method of the invention. 

Figs 2a and 2b are flow charts of alternative preferred embodiments of the invention. 
Fig. 3 is the experimental mass spectrum used to determine the molecular mass of unknown Peptide X. 
Fig 4 is the immonium ion region of the mass spectrum shown in Fig 3, and identifies amino acids contained in 
unknown Peptide X. 

50 Fig 5 is the experimental fragmentation mass spectrum of Peptide X. 

Fig 6 is the experimental mass spectrum used to determine the molecular mass of a Peptide Y. 
Fig 7 is the immonium ion region of the mass spectrum shown in Fig 6, and identifies amino acids contained in 
Peptide Y. 

Fig 8 is the expehmental tandem mass spectrum of Peptide Y. 
55 Fig 9 is the experimental mass spectrum used to determine the molecular mass of a Peptide Z. 

Fig. 10 is the immonium ion region of the mass spectrum shown in Fig. 9, and identifies amino acids contained in 
Peptide Z. 

Fig 11 is the experimental tandem mass spectrum of Peptide Z. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to a de novo method for determining the sequence of an unknown peptide without 
reference to any experimentally determined peptide or nucleotide sequence, and without recourse to a sequential and 

5 Step-wise identification and ordering of individual amino acid residues, such as the Edrrian degradation process or 
interpretation of conventional mass spectrometry fragmentation patterns. In the method of the invention, a library of 
theoretical peptide sequences is generated from a predetermined molecular nrass, preferably the experimentally de- 
termined molGcular mass of an unknown peptide. As such, this library must contain the amino acid sequence of the 
unknown peptide, as well as that of any other peptide having the predetermined molecular mass. The precise amino 

TO acid sequence of the unknown is identified by applying standard correlation functions to select that peptide from the 
synthetic library whose calculated, i.e., theoretical, fragmentation spectrum most closely matches the fragmentation 
pattem of the unknown In the preferred embodiment, the fragmentation spectrum is a mass spectrum and the corre- 
lation method is the function described in U.S. Patent No. 5,538,897, the contents of which are incorporated herein in 
their entirety by reference. Preferably, the theoretical fragmentation spectra are generated and matched to the frag- 

/5 mentation pattern of the unknown using an appropriately programmed computer 

The inventbn may be better understood by reference to the flow chart provided in Fig. 1 . Where the peptide is a 
protein or large polypeptide, the protein or large polypeptide may be cleaved to form a peptide pool by means well 
known in the art. The unknown peptide ('Peptide X") is then separated from the pool by HPLC or any other means 
known in the art, preferably mass spectrometry, and the molecular mass otPeptide X is determined. Although there 

20 are a number of methods for determining the nx>lecular mass of PeptkJe X, the preferred method is again mass spec- 
trometry. 

A set of amino acids that theory or experimental results teach may be included in Peptide X is then defined for 
consideration in determining the sequence of Peptide X. The defined set of amino acids may include modified or 
unnatural amino acids in addition to natural amino acids. 

2S Typically, the method of the invention requires a 'naked" peptide when determining the amino ackJ sequence. 

Therefore, the peptide should be free of any individual amino acids that are covalently modified by post-translational 
modification, such as. e.g., glycosylation, which involves the attachment of carbohydrate to the side chain of certain 
amino acids. Where the method of the invention is used to determine the amino acid sequence of a post-translationally 
modified peptide, the modifications are typically removed from the peptide prior to the analysis, taking due care to 

30 leave the peptide intact. Methods for removing post-translational modifications from peptides are well known in the art, 
and include, for example, the removal of N-linked carbohydrates with enzymes, such as peptlde-N-glycosidase F (PN- 
Gase F), endo-glycosidases, mixtures of exo-glycosidases, etc., and the removal of phosphate modification with phos- 
phatases. In additbn, other techniques for removing modificatbns occasionally found on peptides are well known in 
the art. However, where a specific modification to a specific amino acid is known to be present in the unknown peptide, 

35 the nxxJified amino acid may be included in the defined set of amino acids that theory or experimental results teach 
may be included in Peptide X, and, thus, the sequence of the peptide containing the modified peptide may be determined 
with the method of the present invention. 

All combinations of amino acids having a total mass equal to the measured mass of Peptide X are calculated, 
allowing for water lost in determining peptide links, protonation, etc. Any individual amino acid may be included as part 

"^0 of any given combination at any integral stoichrametry up to the amount consistent with the mass detenmined for Peptide 
X. These combinations comprise all ofthe allowed combinations of amino acids combinations for Peptide X, and, there- 
fore, the actual amino acid compositions of Peptide X will be represented in one and only one of these combinations. 
Furthermore, these combinations are generally peptide-specific. 

An allowed library of linear peptides is then constructed from the allowed combinations of amino acids. The allowed 

^5 library is constructed by generating all possible linear permutations of the sequence of ammo acids in each combination, 
using all the amino acids in each combination. The allowed library comprises all such permutations of the amino acids, 
and therefore must include Peptide X. The altowed library of peptides having the same molecular mass as Peptide X 
is typically constructed independently and ab initio for each new unknown peptide that is sequenced. That is, a new 
library is typically constructed as part of each analysis, and for only that analysis. However, as will be clear to one of 

50 ordinary skill in the art, once a library of all peptides having a given molecular mass has been constructed, that library 
may be used for the determination of the amino acid sequence of any other peptde of that particular molecular mass. 

This differs fundamentally from existing data base approaches in which a single data base of known sequences, 
which is subject to periodic updates and refinements based on the availability of experimentally determined sequences, 
is used for all analyses. As a result, with the method of the present invention, the determination of new and previously 

55 unknown peptides sequences that are not present in any experimentally determined peptide sequence library is pos- 
sible by direct peptide analysis in a non-step-wise, operator-independent automated process. In addition, the method 
ofthe invention is not constrained to the conventional twenty amino acids, or to their conventional modifications. 
In a preferred embodiment, as shown in the flow chart provided in Fig. 2a, additional information relating to Peptide 
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X is used to place constraints on the allowed combinations of ammo acids and/or allowed peptide sequences in the 
library, and, thus, reduce the number of possible sequences. Useful information related to Peptide X includes, but is 
not limited to, partial ammo acid composition. For example, the mass spectrum used to determine the mass of Peptide 
X may include fragments that can be used to identify specific amino acids present In Peptide X. Where it is known that 
5 certain amino acids are definitely present in Peptide X, constraints are placed on the allowed combinations and allowed 
library by requiring the identified amino acids to be present in all combinations and, thus, in every peptide present in 
the library. 

Fig. 2b illustrates a system whereby more than one constraint is put on the library of possible linear sequences. 
By way of illustration only, for each peptide to be analysed (whether purified or present in a mixture), information on 
10 its mass (e.g. by MALDI-MS) and a tandem mass spectrum from it (e.g. by FSI-tandem MS) are obtained. The tandem 
mass spectrum can then be interpreted in an automated manner, to obtain certain information about the unknown 
peptide. Suitable software evaluates the following information, when possible, from a tandem MS spectrum in an au- 
tomated manner: 

75 i. Information on amino acids contained in the peptide by analysing the immontum ions region, 

ii. Identification of the two N-terminal amino acids by identifying the b2/a2 ion pairs. 

ill. Based on the use of trypsin, the C-terminal residue must be lysine or arginine. These can be identified in the 
spectrum and the spectrum interpreted to find the next amino acids. 

iv. Partial Internal sequence information can be obtained by identifying y series of ions with defined accuracy of 
20 mass measurement at < 100 ppm. 

A discussion of manual spectrum interpretation is provided in Medzihradsky and Burlingame, A Companion to 
Methods in Enzymology 6: 284-303 (1994). 

Again with reference to Figs 1 and 2, the allowed library, which has preferably been constrained, is then used as 

2S the basis for generating theoretical fragmentation patterns that are compared to the experimental fragmentation pattern 
obtained for Peptide X. The fragmentation patterns may be obtained by any suitable means known in the art. Preferably, 
the fragmentation patterns are mass spectra, and the method used to match the theoretical and experimental mass 
spectra is that disclosed in U.S. Patent No. 5,538,897. However, protease or chemical fragmentation, coupled to HPLC 
separation of the fragments, may also be used to obtain the experimental fragmentation patterns. 

30 Preferably, in a determination of the amino acid sequence of Peptide X, the molecular mass ofPeplide X is deter- 

mined with high accuracy, typically, to within about 30 ppm (parts per million). An example of such a spectrum is 
provided in Fig 3, where the molecular mass of Peptide X is determined from the peak at 774 3928 daltons In addition, 
as a result of the partial fragmentation of Peptide X that can occur, fragments that identify certain amino acids that are 
contained in Peptide X are also observed, allowing the peptide library to be constrained. An example of this portion of 

3S the mass spectrum for Peptide X is provided in Fig. 4. 

Peptide X is then subjected to collision-induced dissociation in a mass spectrometer. The parent peptide and its 
fragments are then introduced into the second mass spectrometer that provides an intensity or count and the mass to 
charge ratio, m/z, for each of the fragments in the fragment mixture. Each fragment ion is represented in a bar graph 
in which the abscissa value is m/z and the ordinate value is the intensity A variety of mass spectrometer types can be 

■40 used, including, but not limited to, triple-quadrapole mass spectrometry, Fourier-transform cyclotron resonance mass 
spectrometry, tandem time-of-flight mass spectrometry, and quadrapole ion trap mass spectrometry. 

The experimental fragment spectrum is then compared to the mass spectra predicted for the sequences ofthe 
allowed library, to identify one or more predicted mass spectra that closely match the experimental mass spectrum. 
Because the allowed library includes all permutations of amino acid sequences that have a total mass corresponding 

45 to that of Peptide X. Peptide X must be represented in the allowed library. 

The predicted fragmentation spectra may be obtained and compared to the experimental fragmentation spectrum 
by employing a method that involves first normalizing the experimental fragmentation spectrum. This may be accom- 
plished by converting the experimental fragmentation spectrum to a list of nriasses and intensities. The peak values 
for Peptide X are removed, and the square root of the remaining intensity values Is calculated, and normalized to a 

50 maximum value of 1 00. The 200 most intense ions are divided into ten mass regions, and the maximum intensity within 
each region is again normalized to 100. Each ion within 3.0 daltons of its neighbour on either side is given an intensity 
value equal to the greater of the intensity of the ion or that of its neighbour Other normalization methods can be used, 
and it is possible to perform the analysis without normalizing. However in general, normalization is preferred. In par- 
ticular, maximum normalized values, the number of intense ions, the number of mass regions, and the size of the 

55 window for assuming the intensity value of a near neighbour may all be independently varied to larger or smaller values. 

A fragment mass spectrum is predicted for each of the candidate sequences. The fragment mass spectrum is 
predicted by calculating the fragment ion n^asses for the type b and y ions for the amino acid sequence When a peptide 
is fragmented and the charge is retained on the N-terminal cleavage fragment, the resulting ion is labelled as a b-type 
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ion. It the charge is retained on the c-type terminal fragment, it is labelled a y-type ion. Masses for type b ions were 
calculated by summing the amino acid masses and adding the mass of a proton Masses for type y ions were calculated 
by summing, from the c-terminus, the masses of the amino acids and adding the mass of water and a proton to the 
initial amino acid. In this way, it is possible to calculate an m/z value for each fragment 

s However, in order to provide a predicted mass spectrum, It is also necessary to assign an intensity value for each 

fragment. Although it is often possible to predict, on a theoretical basis, an intensity value for each fragment, this 
procedure is difficult, and it has been found useful to assign intensities in the following fashion. The value of 50.0 is 
assigned to each b and y ion. To massos of 1 dalton on either side of the fragment ion, an intensity of 25 0 is assigned. 
Peak intensities of 10 0 are assigned at mass peaks 17.0 and 18 0 dattons below the nVz of each b and y ion location, 

10 to account for both NH3 and H2O loss, and peak Intensities of 10 0 are assigned to mass peaks 28.0 daltons below 
each type b ion location, to account for CO loss. 

After calculation of predicted m/z values and assignment of intensities, it is preferred to calculate a measure of 
closeness-of-fit between the predicted mass spectra and the experimentally-derived fragment spectrum. A number of 
methods for calculating closeness-of-fit are available. For example, a two-step method may be used that includes 

fs calculating a preliminary closeness-of-fit score, referred to here as Sp, and calculating a correlation function for the 
highest-scoring amino acid sequences. In the preferred embodiment, Sp is calculated using the following formula: 

Sp3(Ii^)*n;(1+Pr(1-p)/n^ (1) 

20 

where i^ are the matched intensities, n, are the number of matched fragment ions, p is the type b and y ion continuity, 
p is the presence of immonium bns and their respective amino acids in the predicted sequence, and r\ is the total 
number of fragment ions. The factor p evaluates the continuity ofa fragment ion series. If there is a fragment ion match 
for the ion immediately preceding the current type b or y ion, p is Incremented by 0.075 from an Initial value of 0.0. 

25 This increases the preliminary score tor those peptides matching a successive series of type b and y ions, since ex- 
tended series of ions of the same type are often observed in MS/MS spectra. The factor p evaluates the presence of 
immonium ions in the low mass end of the mass spectrum. 

The detection of immonium ions may be used diagnostcally to determine the presence of certain types of amino 
acids in the sequence. For example, ifimmonium ions are present at 110.0, 120.0, or 136.0 + 1.0 daltons in the proc- 

30 essed data file of the unknown peptide with normalized intensities greater than 40.0, indicating the presence of histidine, 
phenylalanine, and tyrosine respectively, then the sequence under evaluation is checked tor the presence of the amino 
acid indicated by the immonium ion The preliminary score, Sp, for the peptide is either increased or decreased by a 
factor of 1 -p, where p is the sum of the penalties for each of the three amino acids whose presence is indicated in the 
low mass region. Each individual p can take on the value of -0.15 ifthere is a corresponding low mass peak, and the 

55 amino acid is not present in the sequence, +0.15 if there is a corresponding low mass peak and the amino acid is 
present in the sequence, or 0.0 if the low mass peak is not present. The total penalty can range from -0.45, where all 
three low mass peaks are present in the spectrum, but are not present in the sequence, to +0.45, where all three low 
mass peaks are present in the spectrum, and are present in the sequence. 

Following the calculation of the preliminary do sen ess of -fit score. Sp, the predicted mass spectra having the highest 
Sp scores are selected for further analysis using the correlation function. The number of candidate predicted mass 
spectra that are selected for further analysis will depend largely on the computational resources and time available. 

For purposes of cateulating the correlation function, the experimentally-derived fragment spectrum is typically pre- 
processed in a fashion somewhat different from preprocessing employed before calculating Sp. For purposes of the 
correlation function, the precursor ion is removed from the spectrum, and the spectrum is divided into 10 sections. Ions 
in each section are then normalized to 50 0. The sectbn-wise normalized spectra are then used for calculating the 
correlation function. The discrete correlation between the two functions may be calculated as: 



Rx = Lxori + T (2) 



where T is a lag value. The discrete correlation theorem slates that the discrete correlation of two real functions x and 
y is one member of the discrete Fourier transform pair 
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^ \ Y*T (3) 

where X(t) and Y{t) are the discrete Fourier transforms of x{l) and y(i), and the Y* denotes complex conjugation. There- 
^ tore, the cross-correlations can be computed by Fourier transtomiatlon of the two data sets using the fast Fourier 
transform (FFT) algorithm, multiplication of one transform by the complex conjugate of the other, and inverse transfor- 
mation of the resulting product. 

The predicted spectra as well as the pre-processed unknown spectrum may be zero-padded to 4096 data points, 
since the MS/MS spectra are not periodic, as intended by the correlation theorem, and the FFT algorithm requires N 
to be a integer power of two, so the resulting end effects need to be considered. The final score attributed to each 
candidate peptide sequence is R(0) minus the mean ofthe cross-correlation function over the range -75<t<75. This 
modified "correlation parameter", described in Powell and Heiftje, Anal. Chim. Acta, 100:313-327 (1978), shows better 
discrimination over just the spectral correlation coefficient R(0). The raw scores are normalized to 1 .0. Preferably, the 
output includes the normalized raw score, the candidate peptide mass, the unnormalized correlation coefficient, the 
preliminary score, the fragment ion continuity p, the immonium ion factor t, the number of type b and y ions matched 
out of the total number of fragment ions, their matched intensities, the protein accession number, and the candidate 
peptide sequence. 

The correlatbn function can be used to select automatically one of the predicted mass spectra as corresponding 
to the experimentally-derived fragment spectrum. Preferably, however, a number of sequences from the library are 
output and final selection of a single sequence is done by a skilled operator 

Depending on the computing and time resources available, it may be advantageous to employ data-reduction 
techniques. Preferably, these techniques will emphasize the most informative ions in the spectrum while not unduly 
affecting search speed. One technique involves considering only some of the fragment ions in the MS/MS spectrum, 
which, for a peptide, may contain as many as 3,000 fragment ions. According to one data reduction strategy, the ions 
are ranked by intensity, and some fraction ofthe most intense ions is used for comparison. Another approach involves 
subdividing the spectrum into a small number ofregions, e.g., about 5, and using the 50 most intense ions in each 
region as part ofthe data set. Yet another approach involves selecting ions based on the probability ofthose ions being 
sequence ions. For example, ions could be selected which exist in mass windows of 57 through 186 daltons, i.e., the 
range of mass increments for the 20 common amino acids from glycine to tryptophan that contain diagnostic features 
oftype b or y ions, such as losses of 17 or 18 daltons, corresponding to ammonia and water, or a loss of 28 daltons, 
corresponding to CO , 

A number of different scoring algorithms can be used for determining preliminary closeness-of-fit or correlation. 
In addition to scoring based on the number of matched Ions multiplied by the sum of the intensity, scoring can be based 
on the percentage of continuous sequence coverage represented by the sequence ions in the spectrum For example, 
a 10 residue peptide will potentially contain 9 each of b and y type sequence ions. If a set of ions extends from to 
Bg, then a score of 100 is awarded, but if a discontinuity is observed in the middle ofthe sequence, such as a missing 
B5 ion, a penalty is assessed. The maximum score is awarded for an amino acid sequence that contains a continuous 
ion series in both the b and y directions. 

In the event that the described scoring procedures do not delineate an answer, an additional technique for spectral 
comparison can be used in which the database is Initially searched with a molecular weight value and a reduced set 
of fragment Ions. Initial filtering of the database occurs by matching sequence Ions, and generating a score with one 
of the methods described above. The resulting set of answers will then undergo a more rigorous inspection process 
using a modified full MS/MS spectrum. 

For the second stage analysis, one of several spectral notching approaches developed for spectral library search- 
ing is used This will require generating a "library spectrum" for the peptide sequence, based on the sequence ions 
predicted for that amino acid sequence. Intensity values for sequence ions of the "library spectrum" will be obtained 
from the experimental spectrum. If a fragment ion Is predicted at m/z 256, then the Intensity value for the Ion In the 
experimental spectrum at m/z 256 will be used as the intensity of the ion in the predicted spectrum Thus, if the predicted 
spectrum Is identical to the "unknown" spectrum, it will represent an ideal spectrum. The spectra will then be compared 
using a correlation function. In general, it is believed that the majority of computational time for the above procedure 
is spent in the iterative search process. By multiplexing the analysis of multiple MS/MS spectra in one pass through 
the database, an overall improvement in efficiency will be realized. In addition, the mass tolerance used in the initial 
pre-filtering can affect search times by increasing or decreasing the number of sequences to analyze In subsequent 
steps 

Another approach to speeding up searches Involves a binary encryptbn scheme where the mass spectrum is 
encoded as peak/no peak at every mass depending on whether the peak is above a certain threshold value. Ifintensive 
use ofa protein sequence library is contemplated, it may be possible to calculate and store predicted mass values of 
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all sub-sequences wrthin a predetermined range of masses so that at least some of the analysis can be performed by 
table look-up rather than calculation. 

EXAMPLES 

5 

The following non-limiting examples are merely illustrative of the preferred embodiments of the present invention, 
and are not to be construed as limiting the invention, the scope of which is defined by the appended claims. 

EXAMPLE 1. 

10 

The amino acid sequence of unknown Peptide X was determined using the method of the invention. The molecular 
mass of Peptide X was first determined using a matrix-assisted laser-description time-of-flight mass spectrometer 
(Voyager Elite, manufactured by Perseptive Biosystems) with delayed extraction and post source decay As shown in 
Fig. 3, the mass of the protonated form of Peptide X form is 774.3928 daltons, which indicates a mass of 773.3928 
15 daltons for Peptide X. 

The set of amino acids that are possibly part of Peptide X were then defined for consideration in the analysis. The 
defined set of amino acids with the molecular mass of each amino acid less the mass of the one water molecule lost 
during peptide bond formation is provided bebw. The molecular masses are given in daltons or a.m.u. 



20 



25 



tryptophan = 


186.079313 


carbamido cysteine = 


160.03065 


arginine ~ 


156.10111 


phenylalanine = 


147.068414 


histidine = 


137.058912 


methionine = 


131.04085 


glutamic acid = 


129.042593 


lysine = 


128.094963 


glutamine - 


128.058577 


asparagine = 


114.042927 


aspartic acid = 


115.026943 


isoleucine = 


113,084064 


leucine - 


113.084064 


cysteine = 


103.009185 


threonine = 


101.047678 


valine = 


99 068414 


proline = 


97.052764 


serine = 


87.032028 


alanine = 


71.037114 


glycine - 


57.021464 


tyrosine - 


163.063328 







The allowed combinations of amino acids for Peptide X were determined by first determining the molecular mass 
of Peptide X, as described above, to an experimental accuracy of 30 ppm (parts per million). Therefore, each allowed 
combination of ammo acids in the allowed library must have a total mass of 773.3928 ± 30 ppm. In addition to providing 
the molecular mass of Peptide X, the first mass spectrum also confirmed the presence of certain amino acids in Peptide 
X. The immonium region of this mass spectrum, which shows the presence ofthese amino acids, is given in Fig. 4. In 
particular, the immonium region of the spectrum indicates the presence of arginine with a characteristic mass of 
174.988, leucine/isoleucine with a characteristk; mass at 85.8851 (these amino acids have the same nr^ss, and are 
therefore not distinguishable by mass alone), histidine with a characteristic mass at 109.823. and tyrosine with a char- 
acteristic mass at 1 35.91 5. Therefore, it was possible to constrain the allowed library to sets containing arginine, leucine/ 
isoleucine, histidine, and tyrosine, having a total molecular mass of 773.3928 ± 30 ppm. 

To determine the sets of amino acids that have a total molecular mass of 773.3928 ± 30 ppm, the following equation 
was applied: 

MMj^ ^ Z (histidine) + (tyrosine) + (leucine/isoleucine) + 
(arginine) + (HgO) + (aa,) -»- -—- »- (aa^), 

so 

where aa, aa„ are any of the allowed amino acids, other than arginine, isoleucine, histidine, and tyrosine. The only 
combinations of amino acids that can have a total molecular mass of 773 3928 + 30 ppm are as foltows: 

1 ) tryptophan, arginine, leucine/isoleucine, histidine, and tyrosine. 

2) glutamic acid, glycine, arginine, leucine/isoleucine, histidine, and tyrosine. 

3) alanine, aspartic acid, arginine, leucine/isoleucine, histidine, and tyrosine 
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These combinations constitute the allowed sets of amino acids tor Peptide X. 

In addition, Peptide X was obtained by a tryptic cleavage and, therefore, from the accepted specificity of trypsin, 
Peptide X must also have lysine or arginine as its carboxy terminal amino acid. With this constraint, the allowed library 
of linear peptides was constructed from all individual linear permutations of combinations 1, 2, and 3. The allowed 
library includes 528 linear peptides, one set of 264 peptides containing isoleucine (shown below) and a corresponding 
set of 264 peptides in which isoleucine is replaced by leucine (not shown), 



1) YIHWn 


54) HIYGER 


107) 


HGYEIR 


160) DYHIAR 


213) 


lADYHR 


2) lYHWR 


55) YIGHER 


108) 


GHYEIR 


161) HDYIAR 


214) 


AIDYHR 


3) YHIWR 


56) lYGHER 


109] 


YEGHIR 


162) DHYIAR 


215) 


DAIYHR 


4) HYIWR 


57) YGIHER 


iio; 


EYGHIR 


163) IHDYAR 


216) 


ADIYHR 


5) IHYWR 


58) GYIHER 


111) 


YGEHIR 


164) HIDYAR 


217) 


YHDAIR 


6) HIYWR 


59) IGYHER 


112) 


GYEHIR 


165) IDHYAR 


218) 


H YD AIR 


7) YIWHR 


60) GIYHER 


113) 


EGYHIR 


166) DIHYAR 


219) 


YDHAIR 


8) lYWHR 


61) YHGIER 


114) 


GEYHIR 


167) HDIYAR 


220) 


DYHAIR 


9) YWIHR 


62) HYGIER 


115) 


HEGYIR 


168) DHIYAR 


221) 


HDYAiR 


10) WYIHR 


63) YGHIER 


116) 


EHGYIR 


169) YIHADR 


222) 


DHYAIR 


11) IWYHR 


64) GYHIER 


117) 


HGEYIR 


170) lYHADR 


223) 


YHADIR 


12) WIYHR 


65) HGYIER 


118) 


GHEYIR 


171) YHIADR 


224) 


HYADIR 


13) YHWIR 


66) GHYIER 


119) 


EGHYIR 


172) HYIADR 


225) 


YAHDiR 


14) HYWIR 


67) IHGYER 


120) 


GEHYIR 


173) IHYADR 


226) 


AYHDIR 


15) YWHIR 


68) HIGYER 


121) 


IHEGYR 


174) HIYADR 


227) 


HAYDIR 


16) WYHIR 


69) IGHYER 


122) 


HIEGYR 


175) YIAHDR 


228) 


AHYDIR 


17) HWYIR 


70) GIHYER 


123) 


lEHGYR 


176) lYAHDR 


229) 


YDAHIR 


18) WHYIR 


71) HGIYER 


124) 


EIHGYR 


177) YAIHDR 


230) 


DYAHIR 


19) IHWYR 


72)GHIYER 


125) 


HEIGYR 


178) AYIHDR 


231) 


YADHIR 


20) HIWYR 


73) YIEGHR 


126) 


EHIGYR 


179) lAYHDR 


232) 


AYDHIR 


21) IWHYR 


74) lYEGHR 


127) 


IHGEYR 


180) AIYHDR 


233) 


DAYHIR 


22) WIHYR 


75) YEIGHR 


128) 


HIGEYR 


181) YHAIDR 


234) 


ADYHIR 


23) HWIYR 


76) EYIGHR 


129) 


IGHEYR 


182) HYAIDR 


23 5 


) HDAYIR 


24) WHIYR 


77) lEYGHR 


130) 


GIHEYR 


163) YAHIDR 


236) 


DHAYIR 


25)YIHEGR 


78) ElYGHR 


131) 


HGIEYR 


184) AYHIDR 


237) 


HADYIR 


26) lYHEGR 


79) YIGEHR 


132) 


GHIEYR 


185) HAYIDR 


238) 


AHDYIR 


27) YHIEGR 


80) lYGEHR 


133) 


lEGHYR 


186) AHYIDR 


239) 


DAHYIR 


28) HYEGR 


81) YGIEHR 


134) 


EIGHYR 


187) IHAYDR 


240) 


ADHYIR 


29) IHYEGR 


B2) GYIEHR 


135) 


IGEHYR 


188) HIAYDR 


241) 


IHDAYR 


30) HIYEGR 


83) IGYEHR 


136) 


GIEHYR 


189) lAHYDR 


242) 


HIDAYR 


31) YIEHGR 


84) GIYEHR 


137) 


EGIHYR 


190) AIHYDR 


243) 


IDHAYR 


32) lYEHGR 


85) YEGIHR 


138) 


GEIHYR 


191) HAIYDR 


244) 


DIHAYR 


33) YEIHGR 


86) EYGIHR 


139) 


HEGIYR 


192) AHIYDR 


245) 


HDIAYR 


34) EYIHGR 


87) YGEIHR 


140) 


EHGIYR 


193) YIDAHR 


246) 


DHIAYR 


35)IEYHGR 


88) GYEIHR 


141) 


HGEIYR 


194) lYDAHR 


247) 


IHADYR 


36) EIYHGR 


89) EGYIHR 


142) 


GHEIYR 


195) YDIAHR 


248) 


HIADYR 


37) YHEIGR 


90)GEYIHR 


143) 


EGHIYR 


196) DYIAHR 


249) 


lAHDYR 


38) HYEIGR 


91) lEGYHR 


144) 


GEHIYR 


197) IDYAHR 


260) 


AIHDYR 


39) YEHIGR 


92) EIGYHR 


145) 


YIHDAR 


198) DIYAHR 


251) 


HAIDYR 


40) EYHIGR 


93) IGEYHR 


146) 


lYHDAR 


199) YIADHR 


252) 


AHIDYR 


41)HEYIGR 


94)GIEYHR 


147) 


YHIDAR 


200) lYADHR 


253) 


IDAHYR 


42) EHYIGR 


55) EGIYHR 


148) 


HYIDAR 


201) YAIDHR 


254) 


DIAHYR 


43)IHEYGR 


96)GE!YHR 


149) 


IHYDAR 


202) AYIDHR 


255) 


lADHYR 


44) HIEYGR 


97) YHEGIR 


150) 


HIYDAR 


203) lAYDHR 


256) 


AIDHYR 


45)IEHYGR 


98) HYEGIR 


151) 


YIDHAR 


204) AIYDHR 


257) 


DAIHYR 


46) EIHYGR 


99) YEHGIR 


152) 


lYDHAR 


205) YDAIHR 


258) 


ADIHYR 
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(continued) 



47) HEIYGR 


100) EYHGIR 


153) YDIHAR 


206) DYAIHR 


259)HDAIYR 


48) EHIYGR 


101) HEYGIR 


154) DYIHAR 


207) YADIHR 


260) HADIYR 


49) YIHGER 


102) EHYGIR 


155) IDYHAR 


208) AYDIHR 


261) HADIYR 


50) lYHGER 


103) YHGEIR 


156) DIYHAR 


209) DAYIHR 


262) AHDIYR 


51) YHIGER 


104) HYGEIR 


157) YHDIAR 


210) ADYIHR 


263) DAHIYR 


52) HYIQER 


105) YGHEIR 


158) HYDIAR 


211) IDAYHR 


264) ADHIYR 


53) IHYGER 


106) GYHEIR 


159) YDHIAR 


212) DIAYHR 





The method o1 U.S. Patent No. 5,538,897 was then used to nnatch Peptide X to this library by MS/MS. The exper- 
innental tandenn nnass spectrum of Peptide X is shown in Fig. 5, and the 10 top ranking peptides nnatched to this 
spectrum are provided below. It was determined that the sequence of Peptide X is that of the top ranked peptide, 
AHYDIR. 



25 



Rank/Sp 


(M+H) 


Cn 


0*1 OM 


Sp 


Ions 


Reference 


Peptide 


1/1 


774.9 


1 .0000 


1.8118 


491 0 


11/15 


p(228) 


(-)AHYDIR 


2/3 


774.9 


0.9308 


1 6864 


386 2 


10/15 


p(23 8) 


(-)AHDYIR 


3/2 


774.9 


0.8012 


1 4516 


414 3 


10/15 


P(227) 


(-)HAYDtR 


4/5 


774.9 


0.7319 


1 3262 


320 5 


9/15 


p(23 7) 


(-)HADYIR 


5/1 


774.9 


0.7168 


1.2987 


491 0 


11/15 


p(186) 


(-)AHYIDR 


6/12 


774.9 


0.6131 


1 1108 


248 3 


9/15 


p(226) 


(-)AYHDIR 


7/3 


774.9 


0.6033 


1.0930 


386 2 


10/15 


p(192) 


(-)AHIYDR 


S/9 


774.9 


0.5878 


1 0651 


264 1 


9/15 


p(225) 


(-)YAHDIR 


9/50 


774.9 


0.5850 


1 0599 


156 5 


7/15 


p(219) 


(-)YDHAIR 


10/14 


774.9 


0.5825 


1.0553 


247.9 


9/15 


p(217) 


(-)YHDAIR 



EXAMPLE 2. 



The amino acid sequence of Peptide Y, a known, standard peptide, was determined using the method of the 
invention, as applied to Peptide X in Example 1. Peptide Y has the following amino acid sequence: YGGFIRR. The 
molecular mass of Peptide Y was determined to be 868.4719 to an experimental accuracy of 30 ppm from the mass 
spectrum shown in Pig, 6. The masses at 1296.6854 and 1570.6774 are from internal standards, added to albw in- 
strument calibration. 

The set of amino acids that are possibly part of Peptide Y were then defined for consideration in the analysis. The 
defined set of amino acids with the molecular mass of each amino acid less the mass of the one water molecule lost 
during peptide bind formation is the same as those used in Example 1 . 

As the mass of Peptide Y was measured as 868.4719 to an experimental accuracy of ± 30 ppm, each allowed 
amino acid combination must therefore have a total mass equal to 868.471 9 ± 30 ppm. In addition, from the immonium 
ion region of the PSD trace from Fig. 6, shown in Fig. 7, it was detemnined that Peptide Y must also contain the following 
amino acids tyrosine with a characteristic mass at 136.027, phenylalanine with a characteristic mass at 120.071. 
arginine with a characteristic mass at 175.00, and leucine or isoleucine with a characteristic mass at 85.9225. 

Application of the equation in Example 1 demonstrated that only the following combinations of amino acids are 
allowed for Peptide Y: 



1) Tyrosine, phenylalanine, arginine, asparagine, and arginine. 

2) Tyrosine, phenylalanine, arginine, arginine, leucine/isoleucine, glycine, and glycine. 

3) Tyrosine, phenylalanine, arginine, leucineAsoleucine, alanine, alanine, and glutamine. 

4) Tyrosine, phenylalanine, arginine, leucine/isoleucine, glycine, valine, and asparagine 

5) Tyrosine, phenylalanine, arginine, leucine/isoleucine, glycine, glycine, glycine, and valine 

6) Tyrosine, phenylalanine, arginine, leucine/isoleucine, glycine, alanine, alanine, and alanine. 

These combinations constitute the allowed set of amino acid combinations for Peptide Y 

In addition. Peptide Y was obtained by a tryplic cleavage, and, thus, from the accepted specificity oftrypsin, Peptide 
Y must also have lysine or arginine as its carboxy terminal ammo acid. With this constraint, the allowed library of linear 



11 



EP0 887 646 A1 



peptides for Peptide Y is constructed from all individual linear permutations of the combinations above. The allowed 
library includes over 20,000 peptides, and is thus not shown 

As with Example 1. the method of U.S. Patent No. 5,538,897 was then used to match Peptide Y to this library by 
tandem mass spectrometry. The experimental tandem mass spectrum of Peptide Y is shown in Fig. 8, and the top 10 
5 ranking peptides matched to this spectrum are given below. Of these ten, the top ranking peptide, YGGFIRR is known 
to be Peptide Y 



70 



20 



Rank/Sp 


(M+H) 


Cn 


CM 


Sp 


Ions 


Reference 


Peptide 


1/3 


868. 


51.000 


1.894 


3766 


11/24 


P(415) 


(-)YGGFFIR 


2/1 


868.5 


0.967 


1.831 


440.4 


11/24 


p(298) 


(-)YGGRIFR 


3/15 


868.5 


0.966 


1.830 


322.8 


11/28 


p(1975) 


(-)YGGFIGVR 


4/15 


868 5 


0.965 


1.828 


322.8 


11/28 


p(1735) 


(-)YGGFIVGR 


5/5 


868 5 


0 961 


1.821 


361.7 


11/24 


p(454) 


(-)YGGRFIR 


6/2 


8685 


0 960 


1.819 


408.0 


11/24 


p(1311) 


(-)YGVNIFR 


7/12 


868 5 


0951 


1.602 


333.7 


11/24 


p(1527) 


(-)YGVNFIR 


8/8 


8685 


0.942 


1.783 


356,9 


11/28 


p(2153) 


(-)YGGGVIFR 


9/13 


868 5 


0,937 


1.775 


331.0 


11/24 


p(394) 


(-)YGGIFRR 


10/8 


868 5 


0.935 


1.771 


356.9 


11/28 


p(2147) 


(-)YGGVGIFR 



EXAMPLE 3. 

The amino acid sequence of Peptide Z. a known standard peptide, was determined using the method of the in- 
vention, as applied to Peptide X in Example 1 and Peptide Y in Example 2. Peptide Z has the following amino acid 
sequence: RPPGFSPFR. The molecular mass of Peptide Z was determined to be 1060.5660 to an experimental ac- 
curacy of 30 ppm from the mass spectrum shown in Fig 9 The masses at 1181 .6477, 1296.6933 and 1570 6774 are 
from internal standards, added to allow instrument calibration. 

The set of amino acids that are possibly part of Peptide Z were then defined for consideration in the analysis. The 
defined set of amino acids with the molecular mass of each amino acid less the mass of the one water molecule lost 
during peptide bond formation is the same as those used in Examples 1 and 2. 

Asihe mass of Peptide Z was measured as 1060.5660 to an experimental accuracy of30 ppm, each allowed amino 
acid combination must therefore sum to a mass equal to 1060.5660 ± 30 ppm. In addition, from the immonium Ion 
region of the PSD trace from Fig. 9, shown in Fig. 10, it was determined that Peptide Z must also contain the following 
amino acids: phenylalanine with a characteristic mass at 120.20, arginlne with a characteristic mass at 174.94, serine 
together with proline as deduced from the mass at 1 67.23, and glycine together with proline as deduced from the mass 
at 155.66. 

Application of the equation in Example 1 was used to determine the allowed combinations of amino acids for 
Peptide Z, and demonstrates that only the following combinations of amino acids are allowed for Peptide Y: 



45 



SO 



55 



PTIW + FRPSG 


GAAAAR + FRPSG 


GVVNK + FRPSG 


AWID + FRPSG 


WIW+ FRPSF 


GPPVF + FRPSG 


ASPNK + FRPSG 


SPWD + FRPSG 


GQRR + FRPSG 


GPVIM + FRPSG 


GGAAIK + FRPSG 


GVINN + FRPSG 


ANRR + FRPSG 


APWM + FRPSG 


GGGPTK + FRPSG 


AWNN + FRPSG 


GOARR+FRPSG 


AAIIE + FRPSG 


GGGWK + FRPSG 


GGGVIN + FRPSG 


PPFR + FRPSG 


GPTIE + FRPSG 


GAAAVK + FRPSG 


GAAAIN + FRPSG 


PIMR +- FRPSG 


GWIE + FRPSG 


GGASPK + FRPSG 


GGAWN + FRPSG 


VIER + FRPSG 


ASPIE + FRPSG 


IQQQ + FRPSG 


AAAAVN + FRPSG 


VNQR + FRPSG 


APVTE + FRPSG 


GAIQQ + FRPSG 


SSPII + FRPSG 


CGVQR + FRPSG 


AWVE + FRPSG 


AAVQQ + FRPSG 


SPVTI + FRPSG 


AAAQR + FRPSG 


GSPKK + FRPSG 


AAINQ + FRPSG 


GGGGGVI + FRPSG 


IIDR + FRPSG 


IQQK + FRPSG 


GVVNQ+ FRPSG 


GGGAAAAI + FRPSG 


INM + FRPSG 


GAIQK + FRPSG 


GGAAIQ + FRPSG 


PPTTT + FRPSG 


GGINR + FRPSG 


AAVQK + FRPSG 


GGG WQ + FRPSG 


PWTT + FRPSG 


GAVNR + FRPSG 


GSPQK + FRPSG 


AAAVQ + FRPSG 


GGGGAW + FRPSG 


GGGGIR + FRPSG 


AAINY + FRPSG 


GVIID + FRPSG 


GGAAAAV + FRPSG 
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(continued) 



GGGAVR + FRPSG 


GPTNK + F RPSG 


APTID + FRPSG 


AAAAAAA + FRPSG 



5 These combinations constitute the allowed set of amino acid combinations for Peptide Z. 

In addition, Peptide Z was obtained by a tryptic cleavage, and, from the accepted specificity oftrypsin, Peptide Z 
must have lysine or arginine as its carboxy terminal amino acid. With this constraint, the allowed library of linear peptides 
for Peptide Z ie constructed from all individual linear pennutations of the combinatbns above. The allowed library 
includes over 2,000,000 peptides, and is thus not shown. 

10 As with Examples 1 and 2, the method of U.S. Patent No. 5,538,897 was then used to match Peptide Z to this 

library by tandem mass spectrometry. The experimental tandem mass spectrum of Peptide Z is shown In Fig. 11 , and 
the top 10 ranking peptides matched to this spectrum provided below Of these ten, the top ranking peptide, RPPGF- 
SPFR is known to be Peptide Z. 



Rank/Sp 


(M + EI) 


Cn 


G A4 


Sp 


Ions 


Reference 


Peptide 


1/1 


1061.2 


1 000 


3.310 


1163.5 


19/24 


p(135) 


(-)RPPGFSPFR 


2/2 


1061.2 


0 871 


2.884 


1126.6 


19/24 


P(120) 


(-)RPPGFPSFR 


3/5 


1061.2 


0 857 


2.835 


8247 


17/24 


p{122) 


(-)RPPFGPSFR 


4/11 


1061.2 


0 849 


2.811 


692.8 


16/24 


p(164) 


(-)RPPGFFPSR 


5/4 


1061.2 


0 833 


2.759 


831 2 


17/24 


p(189) 


(-)RPPGFFSPR 


6/3 


1061.2 


0 831 


2.749 


8729 


17/24 


p{131) 


{-)RPPSFGPFR 


7/6 


1061.2 


0819 


2.711 


797 1 


17/24 


p(126) 


(-)RPFGPPSFR 


8/12 


1061.2 


0 806 


2.668 


6740 


16/24 


p(100) 


(-)RPPGPSFFR 


9/13 


1061.2 


0.792 


2.623 


6684 


16/24 


p(137) 


(-)RFPPGSPFR 


10/14 


1061.2 


0.782 


2.588 


6565 


16/24 


p(138) 


(-)RFGPPSPFR 



While it is apparent that the invention disclosed herein is well calculated to fulfill the objectives stated above, it will 
be appreciated that numerous modifications and embodiments may be devised by those skilled in the art. Therefore, 
it is intended that the appended claims cover all such modifications and embodiments that fall within the true spirit and 
scope of the present Invention. 
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Annex to the descrlpti n 



SEQUENCE LISTING 

5 

(1) GENERAL INFORMATIONS 



(i) APPLICANT: 

(A) NAME: OXFORD GLYCOSCIKNCBS (UK) LTD 

(B) STREET: 10 The Quadrant, Barton Lane, Abingdon 
10 Science Park 

(C) CITY! Abingdon 

(D) STATE: N/A 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): 0X14 3YS 



(ii) TITLE OF INVENTION: A METHOD FOR DE NOVO PEPTIDE SEQUENCE 
DETERMINATION 



(iii) NUMBER OF SEQUENCES: 353 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release /l.O, Version #1*30 (EPO) 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 98304111.2 

25 

(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCJB CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Tyr Ilo His Trp Arg 
1 S 

(2) INFORMATION FOR SEQ ID NO: 2: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

.5 (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 

lie Tyr His Trp Arg 
1 5 

50 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 3 

Tyr HiB lie Trp Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 unino acldg 

(B) TYPES amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Hia Tyr lie Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

lie His Tyr Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

His lie Tyr Trp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH; S amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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10 



(xi| SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Tyr He Trp Hia Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHAJ^ACTERISTICS : 

(A) LENGTH; 5 &mino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

He Tyr Trp Hifl Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

26 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



15 



30 



45 



SO 



Tyr Trp He His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Trp Tyr He His Arg 
1 5 



55 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

He Trp Tyr His Arg 
1 5 
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10 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOi 12 

Trp lie Tyr His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Tyr Hifl Trp lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



35 



40 



SO 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

His Tyr Trp lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Tyr Trp His lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
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10 



25 



30 



40 



45 



(A) LENGTH: 5 amino acide 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLCXiy: linear 

(ii) MOLECULE TrPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 16 

Trp Tyr His lie Arg 

1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acida 
'iS (B) TYPE: aunino acid 

(C) STRANDEDNESS: single 

(D) TOPOIXXSY: linear 

(ii) MOLECULE TYPE: peptide 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Hie Trp Tyr lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: & amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Trp His Tyr lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO; 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acida 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

lie His Trp Tyr Arg 

1 5 

SO (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

55 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

His He Trp Tyr Arg 

1 5 

10 <2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 aittino aclde 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 
75 (D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



20 



30 



35 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

He Trp His Tyr Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH! 5 amino acida 
(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 22 

Trp He His Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

His Trp He Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

Trp His lie Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino Bcids 

(B) TTPE: unlno acid 

(C) STRAKDSDNESSt single 

(D) TOPOLOCyi linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Tyr lie His Glu Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 26: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE I peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 



15 



30 



He Tyr Hie Glu Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE i peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

Tyr His He Glu Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY! linear 

SO (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

His Tyr Glu Gly Arg 
1 5 

55 



20 
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10 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 anino acids 
{B) TVPE: amino acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

lie His Tyr Glu Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

25 His He Tyr Glu Gly Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 31: 



15 



20 



30 



35 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 31 



Tyr He Glu His Gly Arg 
1 5 

40 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii| MOLECULE TYPE: peptide 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

He Tyr Glu His Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPEt Amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

10 Tyr Clu lie His Gly Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
j5 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(C> STRANDEDNESS: Single 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPEt peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

Glu Tyr He Hta Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO; 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



3S 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

He Glu Tyr His Gly Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

Glu He Tyr His Gly Arg 
1 5 

50 

(2) INFORMATION FOR SEQ ID NO: 37 i 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

Tyr Hia Glu He Gly Arg 
1 5 

10 (2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

Hia Tyr Glu He Gly Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
25 <A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

Tyr Glu Hie He Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: eunino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

Glu Tyr His He Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 41: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

His Giu Tyr lie Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

Clu His Tyr lie Gly Arg 

1 5 

<2) INFORMATION FOR SEQ ID NO: 43: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: & amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 



15 



30 



lie His Glu Tyr Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 2unino acids 

(B) TYPE; amino acid 

^5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

His lie Glu Tyr Gly Arg 
1 5 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

He Glu His Tyr Gly Arg 
1 5 



24 



EP0 887 646 Al 



10 



(2) INFORMATION FOR SEQ ID NO: 46: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 unino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

Glu Ilo His Tyr Cly Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

25 His Glu lie Tyr Gly Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 48: 



75 



20 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 

Glu His lie Tyr Gly Arg 

1 5 

40 [2) INFORMATION FOR SEQ ID HO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SO 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

Tyr He His Gly Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPEi amino acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEs peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

lie Tyr His Gly Glu Arg 

10 15 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 51 

Tyr HiB lie Gly Glu Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 52: 

25 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

Hie Tyr lie Gly Glu Arg 

35 15 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



^5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

He His Tyr Gly Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 54: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



55 



26 
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(D) TOPOLOGY: linear 
(11) MOLECULE TyPE: p«ptldo 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 54 

Hi8 He Tyr Gly Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 55: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEONESS: Single 

(D) TOPOLOGY: Linear 

(11) MOLECULE TYPE: p«ptldo 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

20 Tyr He Gly Hie Glu Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(1) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



IS 



30 



4S 



(11) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 



He Tyr Gly His Glu Arg 
1 5 

35 (2) INFORMATION FOR SEQ ID NO: 57; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

Tyr Gly He His Glu Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 58: 

(1) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(11) MOLECULE TYPE: peptide 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

Gly Tyr He His Glu Arg 
1 5 

5 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TXPEi amino acid 

70 (C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 59 

He Gly Tyr Hie Glu Arg 
1 S 

(2) INFORMATION FOR SEQ ID NO: 60: 

2^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

Gly He Tyr His Glu Arg 
1 5 



30 



35 



45 



SO 



55 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6 amino acids 

(B) TYPE! amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

Tyr His Gly He Glu Arg 
1 S 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEi peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

His Tyr Gly He Glu Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 anino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

Tyr Gly His lie Glu Arg 
1 5 

75 (2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 



Gly Tyr His lie Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

His Gly Tyr He Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

Gly His Tyr He Glu Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 67: 
55 (1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPEi amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 



lie His Gly Tyr Glu Arg 
1 5 

|2) INFORMATION FOR SKQ ID NO: 68s 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 



His lie Gly Tyr Glu Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 69 

lie Gly His Tyr Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

Gly He His Tyr Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 



30 



EP0 887 646 A1 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 71 

His Gly lie Tyr Glu Arg 
1 5 

10 (2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



20 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 



Gly His He Tyr Glu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 

Tyr He Glu Gly His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

He Tyr Glu Gly His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

Tyr Glu lie Gly Hio Arg 
1 5 

5 

(2) INFORMATION FOR SEg ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

Glu Tyr lie Gly Hia Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 77 

lie Glu Tyr Gly Hia Arg 

30 1 5 

(2) INFORMATION FOR SEQ ID NO; 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: peptide 

''^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

Glu lie Tyr Gly His Arg 
1 5 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 79: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

Tyr He Gly Glu His Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TVPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

lie Tyr Gly Glu His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 



25 Tyr Gly lie Glu His Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 



Gly Tyr lie Glu His Arg 
1 5 

40 (2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

lie Gly Tyr Glu His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 



33 



EP 0 887 646 A1 



70 



75 



(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

Giy He Tyr Glu His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 85: 

(1] SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: aingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

Tyr Glu Gly He His Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

Glu Tyr Gly He His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

Tyr Gly Glu He His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQUENCE DESCRIPTION: SEg ID NO: 88 

Gly Tyr Glu He Hia Arg 
1 5 

10 (2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



20 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 



Glu Gly Tyr He His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 



Gly Glu Tyr lie His Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS; single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 

He Glu Gly Tyr Hie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 92: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

£5 (ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

Glu lie Gly Tyr Hia Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
|B) TYPE: amino acid 
|C) STRANDEDNESS: single 
(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

He Giy Glu Tyr His Arg 
1 5 

(2) INFORMATION FOR SKQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
[B} TYPE: amino acid 

[C) STRANDEDNESS: single 

[D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 



IS 
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35 



40 



45 



50 



55 



Gly He Glu Tyr Hia Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 95: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

Glu Gly He Tyr His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 

Gly Glu He Tyr His Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: flingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Tyr HiB Glu Gly He Arg 

1 5 

15 (2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

His Tyr Glu Gly He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: aunino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Tyr Glu His Gly He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Glu Tyr His Gly lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 101: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 



10 His Glu Tyr Gly He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 102: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

( B ) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



Glu His Tyr Gly He Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

Tyr His Gly Glu He Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: G amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY! linear 



55 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

His Tyr Gly Glu He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH I 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: lin ar 
(ii) MOLECULE TYPE: peptide 

5 

(>fi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 

Tyr Gly His Glu lie Arg 

1 5 

10 (2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 6 anino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESSs single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 

Gly Tyr His Glu lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

His Gly Tyr Glu He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 108: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 

Gly His Tyr Glu He Arg 
1 5 



55 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Tyr Glu Gly Hio lie Arg 
1 5 

5 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 amino acids 
(B) TYPE: amino acid 
TO (C) STRANDEDNESS : Single 

(D) TOPOLOGY: linoar 

(ii) MOLECULE TYPE: peptide 



15 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Glu Tyr Gly His lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTHt 6 amino acids 
(0) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOj 111: 

Tyr Gly Glu His lie Arg 

30 1 5 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPEz amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 112: 

Gly Tyr Glu His He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Glu Gly Tyr Hie He Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

Gly Glu Tyr His He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 

25 His Glu Gly Tyr He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 116: 



IS 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: eingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 



Glu His Gly Tyr He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 

His Gly Glu Tyr He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 118: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOIOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 118 

Gly His Glu Tyr lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 119: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 

Glu Gly His Tyr lie Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



30 
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SO 



55 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 

Gly Glu His Tyr He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 121 

He His Glu Gly Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 2 

Hi« He Glu Gly Tyr Arg 
1 5 

JO (2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 

20 He Glu Hia Gly Tyr Arg 

1 5 

<2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 

Glu lie Hia Gly Tyr Arg 
1 5 

3S (2) INFORMATION FOR SEQ ID NO: 12 5: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i> 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 

His Glu lie Gly Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 

GLu His lie Gly Tyr Arg 
1 5 

5 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

70 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 

He His Gly Glu Tyr Arg 
1 5 



20 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acida 

(B) TYPE: anino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 

His He Gly Glu Tyr Arg 

30 15 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 

He Gly His Glu Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 



Gly He Hia Glu Tyr Arg 

55 15 
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(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOCy; linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 

His Gly lie Glu Tyr Arg 

1 5 

75 (2) INFORMATION FOR SEQ ID NO: 132 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

Gly Hia He Glu Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



40 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

Ho Glu Gly His Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 

Glu He Gly His Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 135: 

55 (i) SEQUENCE CHARACTERISTICS: 



45 



SO 



45 



EP 0 887 646 A1 



IS 



20 



35 



40 



46 



SO 



(A) LENGTH: 6 amino acids 

(B) TYPES amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xt) SEQUENCE DESCRIPTION: SZQ ID NO: 135 



10 lie Gly Glu Hie Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 136: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 



Gly He Glu His Tyr Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 

Glu Giy He Hia Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 138 

Gly Glu He Hia Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 

Hie Glu Gly He Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: B ingle 

(D) TOPOLOGY 1 linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 140 

Glu His Gly lie Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS! 
25 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



75 



20 



30 



35 



45 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 

His Gly Glu He Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 142: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 

Gly Hio Glu He Tyr Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 143: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 143 

Glu Gly Hia lie Tyr Arg 

1 5 

(2) IWFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 
(X) LENGTH; 6 amino acida 

(B) TYPEx amino acid 

(C) 5TRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 

Gly Glu His lie Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 145: 

20 (i) SEQUENCE CHARACTERISTICS j 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 
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[ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 

Tyr lie His Asp Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 146: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

35 (C> STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 

lie Tyr His Aap Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

SO (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 

Tyr His lie Asp Ala Arg 
1 5 

55 
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(2) INFORMATION FOR SEQ ID NO: 148: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION j SEQ ID NO: 148 

His Tyr lie Asp Ala Arg 

1 5 

75 (2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 



He His Tyr Asp Ala Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS! 
30 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 

His He Tyr Asp Ala Arg 
1 5 

''^ (2) INFORMATION FOR SEQ ID NO: 151: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

Tyr He Asp His Ala Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 152: 
55 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY! linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 

He Tyr Asp His Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: aingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 

Tyr Asp lie His Ala Arg 

1 5 

25 (2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 

Asp Tyr He His Ala Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 

He Asp Tyr Hia Ala Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 

Asp He Tyr His Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157 

20 Tyr His Asp He Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 158: 



15 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 



His Tyr Asp He Ala Arg 
1 5 

35 (2) INFORMATION FOR SEQ ID NO: 159; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 ^ ' 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

45 Tyr Asp His He Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 
Asp Tyr His He Ala Arg 

s 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS: single 

(D) TOPOLOCy: linear 

(ii) MOLECULE TYPE: peptide 

j5 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 161 

His Asp Tyr lie Ala Arg 

1 5 



20 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 amino acide 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (li) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 

Aap His Tyr He Ala Arg 

30 15 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

He His Asp Tyr Ala Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 

His He Asp Tyr Ala Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRTPTTON: SEQ TD NO: 165 

lie Asp His Tyr Ala Arg 

1 5 

(2) INFORKATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 

Aep lie His Tyr Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 

His Asp lie Tyr Ala Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 16B 

Aep Hio lie Tyr Ala Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPE: «u«ino acid 

(C) STRAKDEDNESS: oingle 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 

10 Tyr lie His Ala Asp Arg 

1 5 

(2) INFORMATION FOR SBQ ID NO: 170: 

(i> SEQUENCE CHARACTERISTICS: 
,5 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS! single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 

lie Tyr Hie Ala Aap Arg 
1 5 

2S (2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
<Bi TYPE: amino acid 
(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 

Tyr His lie Ala Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

His Tyr lie Ala Asp Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE; peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

He HiB Tyr Ala Asp Arg 

1 5 

(2) INFORMATION FOR SEg ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) 5TRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEi peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 

20 His He Tyr Ala Asp Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 175: 

(1) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



30 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

Tyr He Ala His Asp Arg 
1 5 

35 ^2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



45 



SO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 

He Tyr Ala His Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 177: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 

Tyr Ala lie His Asp Arg 
1 5 

5 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESSi single 

(D) TOPOLOGY t linaar 

(ii) MOLECULE TYPE: peptide 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 

Ala Tyr lie His Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO; 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 

lie Ala Tyr His Asp Arg 

30 15 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

Ala He Tyr His Asp Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 

Tyr His Ala He Asp Arg 

1 5 
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(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTHt 6 amino acide 

(B) TYPE; amino acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1B2 

His Tyr Ala lie Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 



25 Tyr Ala His lie Asp Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 184: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 

Ala Tyr His lie Aop Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 185 t 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 

His Ala Tyr lie Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

( B ) TYPE : amino acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 

10 Ala His Tyr lie Asp Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 
/5 (A) X/ENGTH: 6 2unino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



35 



40 



45 



SO 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 



lie His Ala Tyr Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: IBS: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 188 

His lie Ala Tyr Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 189; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 

lie Ala His Tyr Asp Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQaENCE DESCRIPTION: SEQ ID NO: 190 

Ala lie Hl8 Tyr Aap Arg 
1 5 

10 (2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ! eingl© 
75 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 



2S 



30 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

Hie Ala Ilo Tyr Aop Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 

Ala His lie Tyr Asp Arg 
1 S 

35 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

40 <C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 

Tyr lie Asp Ala His Arg 

1 5 



55 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY I linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 

lie Tyr Asp Ala His Arg 
1 5 

5 

(2) INFORMATION TOR SKQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS: single 

(D) TOPOLOGY I linoar 

(ii) MOLECULE TYPE: peptide 



15 
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(xi) SEQUENCE DESCRIPTION: SEQ ID KO: 195 

Tyr Asp lie Ala His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acide 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

Asp Tyr lie Ala His Arg 

30 1 5 

(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 ainino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 

lie Asp Tyr Ala His Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

Asp He Tyr Ala His Arg 

1 5 
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(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTHi 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 

Tyr lie Ala Aap His Arg 
1 5 

^5 (2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 



25 He Tyr Ala Asp Hia Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 

Tyr Ala He Asp His Arg 

1 5 

40 (2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 

Ala Tyr He Asp His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acids 

(B) TYPEt amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 



10 He Ala Tyr Asp His Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 204: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEt peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 

Ala He Tyr Aap His Arg 

1 5 

25 (2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECUI>E TYPE: peptide 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 205 

Tyr Asp Ala He His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 

Asp Tyr Ala He His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 207: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

Tyr Ala Asp lie His Arg 
1 5 

10 (2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
^5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 

Ala Tyr Asp lie His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NOi 209: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 aroino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 
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(ii) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 

Asp Ala Tyr lie His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 210: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 210 

Ala Asp Tyr lie His Arg 

1 5 



(2) INFORMATION FOR SEQ 3D NO: 211: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

lie Aep Ala Tyr His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY 5 linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

Asp He Ala Tyr His Arg 
1 5 

<2) INFORMATION FOR SEQ ID NO: 213 i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

He Ala Asp Tyr His Arg 

30 15 

(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 

Ala He Asp Tyr His Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 



Asp Ala He Tyr His Arg 

55 15 
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(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acida 

(B) TYPE: arolno acid 

(C) STRANDEDNESS: single 

(D) TOPOUX;y: linaar 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 

Ala Asp lie Tyr His Arg 

1 5 

j5 (2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

^5 Tyr His Asp Ala He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 6 amino acids 

(B) TVPEt aunino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 

His Tyr Asp Ala He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NOt 219: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
^5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 

Tyr Asp His Ala He Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 220: 
55 (i| SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acide 

(B) TYPEt amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 

10 Asp Tyr HiB Ala He Arg 

1 5 

(2) INFOHMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 

His Aap Tyr Ala He Arg 

1 5 

25 (2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 ainino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 

Asp His Tyr Ala lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223 

Tyr His Ala Asp He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 224: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 2unino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 
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(D) TOPOLOCy: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 

His Tyr Ala Asp lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 225: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino aclde 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

Tyr Ala His Asp lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 

Ala Tyr Hie Asp lie Arg 
1 5 

55 ^2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 

His Ala Tyr Asp lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22B; 

Ala Hia Tyr Asp lie Arg 
1 5 

5 

(2) INFORMATION FOR SEQ ID NOx 229: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS: single 

( D ) TOPOLrOOY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

Tyr Asp Ala His II© Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 230: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 

Asp Tyr Ala His He Arg 

30 15 

(2) INFORMATION FOR SKQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acide 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 

Tyr Ala Asp His He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

Ala Tyr Asp His He Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

Asp Ala Tyr His lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



25 Ala Asp Tyr His lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 235: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY! linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

His Asp Ala Tyr lie Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPEt amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 

50 Asp His Ala Tyr He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 237: 
(i) SEQUENCE CHARACTERISTICS: 

55 



69 



EP 0 887 646 A1 



(A) LENGTH: 6 amino acids 

( B ) TYPE : amino acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptid* 

(XL) SEQUENCE DESCRIPTION: SEQ ID NO: 237 

^0 His Ala Asp Tyr He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 6 amino acids 

(B) TYPE: ainino acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



30 



35 



SO 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 38 

Ala His Asp Tyr He Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 

Asp Ala His Tyr He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 240: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 



55 



Ala Asp His Tyr He Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLCX;Y: linear 
MOLECULE TVPEi peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 

lie His Asp Ala Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHi 6 ajnino acido 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

HiB He Asp Ala Tyr hrg 

I 5 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH; 6 amino acids 

{B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 

He Asp His Ala Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 244: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 

Asp He His Ala Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 245 i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

His Asp lie Ala Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 
(X) LENGTH t 6 amino acids 

(B) TYPEt amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY! linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

Asp His Ilo Ala Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NOi 247: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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lie His Ala Asp Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D| TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

His lie Ala Asp Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY* linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

lie Ala His Asp Tyr Arg 

1 5 
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70 



(2) INFORMATION TOR SEQ ID NO; 2 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: unino acid 

(C) STRANOEONESS: Single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: paptido 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 

Ala lie His Asp Tyr Arg 
1 5 

(2} INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNSSS: singlo 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251 

25 His Ala lie Asp Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 2 52: 



15 



20 



30 



45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 

Ala Hia lie Asp Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS! single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

lie Asp Ala His Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 254: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 6 amino acide 

( B) TYPS: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLCXSY; linear 

(ii) MOLECULE TYPE: peptid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 



)0 Asp lie Ala His Tyr Arg 

1 S 

(2) INFORMATION FOR SEQ ID NO: 255: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 



lie Ala A8p His Tyr Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 25 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 

Ala lie ABp His Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 

Aep Ala He Hia Tyr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 

Ala Asp lie Hi* Tyr hrg 

1 5 

W (2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS: eingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

20 His Asp Ala lie Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 
2^ (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



40 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 



His Ala Asp lie Tyr Arg 
1 5 

35 (2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 

His Ala Asp lie Tyr Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

Ala Hie Aep lie Tyr Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

Asp Ala His lie Tyr Arg 
1 5 



TO 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 264: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEx peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

Ala Aep Hie lie Tyr Arg 

30 15 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 7 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 265: 

Tyr Cly Gly Phe lie Arg Arg 
1 5 

{2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
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Tyr Gly Gly Phe Phe lie Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUEHCE DESCRIPTIONt SEQ ID NO* 267 

IS Tyr Gly Gly Arg lie Phe Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



25 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

Tyr Gly Gly Phe He Gly Val Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acide 

(B) TYPE! amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269 

Tyr Gly Gly Phe He Val Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 270: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

SO (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 

Tyr Gly Gly Arg Phe He Arg 

55 1 5 
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(2) INFORMATION FOR SEQ ID NO: 2 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acidB 

(B) xrPX: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION! SEQ ID NO: 271 

Tyr Gly Val Asn He Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acide 

(B) xyPE* amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE; peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 272 

Tyr Gly Val Asn Phe He Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 8 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 

Tyr Gly Gly Gly Val He Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 274: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 

Tyr Gly Gly He Phe Arg Arg 

so 15 

{2} INFORMATION FOR SEQ ID NO: 27 5: 
(i) SEQUENCE CHARACTERISTICS J 

£5 



45 
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(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

10 Tyr Gly Gly Val Gly lie Phe Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTHS 9 anino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 

Arg Pro Pro Gly Phe Ser Pro Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 27 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 

Pro Thr lie Trp 
1 

(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 

Phe Arg Pro Ser Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 279: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
£5 (D) TOPOLOGY: linear 
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(il) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 

5 

Gly Ala Ala Ala Ala Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 280: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acida 

(B) Typ£: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 



20 
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Gly Val Val Aen Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

Ala Val Val lie Asp 
1 S 

(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 
(D) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 

45 Gly Pro Pro Val Phe 

1 5 

(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS; 
£0 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE; peptide 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 

Ala Ser Pro Asn Lyo 

1 5 

5 

(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 
Ser Pro Val Val Asp 

a 5 

(2) INFORMATION FOR SEQ ID NO: 285: 

20 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGY: linear 

2s (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 



30 



Gly Gin Arg Arg 

1 

(2) INFORMATION FOR SEQ ID NO: 2 86: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 5 amino acids 
(B) TYPE: aninc acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 

Gly Pro val II© Met 

1 5 

(2) INFORMATION FOR SEQ ID NO: 287: 

^5 (j^j SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

SO (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 

Gly Gly Ala Ala lie Lys 
1 5 

55 
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(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: ajnino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 288; 

Gly Val lie Asn Abh 
1 5 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289; 

Ala Asn Arg Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Ala Pro Val Val Met 

1 5 

''^ (2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 6 amino acide 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Gly Gly Gly Pro Thr Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 292: 
55 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



TO 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 



Ala Val Val Asn Asn 

1 5 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: aingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 

Gly Xaa Ala Arg Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294 

Ala Ala lie lie Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295 



Gly Gly Gly Val Val Lys 
1 5 

50 

(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 
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(D) TOPOLCXjy: linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 296 

Gly Gly Gly Val lie Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 

Pro Pro Phe Arg 
1 

(2) INFORMATION FOR SEQ ID NO; 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPRi amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



IS 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 

Gly Pro Thr lie Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 



45 Gly Ala Ala Ala Val Lys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 300: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 



84 



EP 0 887 646 A1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 

Gly Ala Ala Ala lie Asn 
1 5 

5 

(2} INFORMATION FOR SEQ ID NO: 301: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS: single 

(D) TOPOLOGY* linear 

(ii) MOLECULE TYPE: peptide 

^5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 

Pro lie Met Arg 
1 



20 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 

Gly Val Val He Glu 

30 1 5 

(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 

Gly Gly Ala Ser Pro Lys 
1 5 



35 



45 



SO 



55 



(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION! SEQ ID NO: 304 

Gly Gly Ala Trp Asn 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 aniino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 30S 

Val He Glu Arg 
1 

(2) INFORMATION FOR SEQ ID NOi 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 

25 Ala Ser Pro He Glu 

1 5 

(2) INFORMATION FOR SEQ ID HO: 307: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGlf: linear 



20 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 

IlG Gin Gin Gin 

1 

(2) INFORMATION FOR SEQ ID NO: 308: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 

Ala Ala Ala Ala Val Asn 

1 5 

(2) INFORMATION FOR SEQ ID NO: 309: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: p ptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 

Val Asn Gin Arg 

1 

(2) INFORMATION FOR SEQ ID NOj 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 

Ala Pro Val Thr Glu 

1 5 

25 {2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 



3S 



46 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 

Gly Ala lie Gin Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 

Ser Ser Pro lie lie 
1 5 

(2) INFORMATION FOR SEQ ID NO: 313: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

£5 (C) STRANDEDNESS: single 
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(D) TOPOLOCr: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 

eye Gly Val Gin Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 314: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: single 
15 (D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 

Ala Val Val Val Glu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHi 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 

Ala Ala Val Gin Gin 

1 5 

(2) INFORMATION FOR SEQ ID NO: 316: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 

Ser Pro Val Tlir Tie 
1 5 

(2) INFORMATION FOR SEQ ID NO; 317; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 

Ala Ala Ala Gin Arg 
1 5 

5 

(2) INFORMATION FOR SEQ ID NO: 318: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acidB 

(B) TYPE: amino acid 

(C) STRANDEONSSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 

Gly Ser Pro Lya Lys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 

Ala Ala lie Asn Gin 

30 1 5 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 

Gly Gly Gly Gly Gly Val He 
1 5 

(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 



He He Asp Arg 

55 1 
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(2) INFORMATION FOR SEQ ID NO: 322: 

(1) SEQUBNCI CHARACTERISTICS: 

(A) LENCTHi 4 unino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOFOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 

lie Gin Gin Lye 

1 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: S axnino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

25 Gly Val Val Asn Gin 

1 5 

(2) INFORMATION FOR SEQ ID NO: 324: 



75 



30 



35 



(i) SEQUENCE CHARACTBRISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 

Gly Gly Gly Ala Ala Ala Ala He 

1 5 

^0 (2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SO 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

Gly Ala lie Gin Lys 

1 5 

(2) INFORMATION FOR SEQ ID NO: 326: 
(i) SEQUENCE CHARACTERISTICS; 
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15 



(A) LENGTH: 5 amino acids 

(B) TYPEt amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE description: SEQ ID NO: 32 6 

Gly Gly Ala Ala lie Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 327: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 

Pro Pro Thr Thr Thr 

1 5 

25 (2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE! anino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY! linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

Gly Gly lie Asn Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 

Ala Ala Val Gin Lye 
1 5 

(2) INFORMATION FOR SEQ ID NO: 330: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 
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{D) TOPOLOGY: linear 
{ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 

Gly Gly Gly Val Val Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESSx Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 

20 Pro Val Val Thr Thr 

1 5 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



15 



30 



<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 

Gly Ala Val Asn Arg 
1 & 

35 (2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



45 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 

Gly Ser Pro Gin Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Ala Ala Ala Val Gin 

1 5 

5 

{2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCB CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

,5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Gly Gly Giy Gly Ala Val Val 

1 5 

(2) INFORMATION FOR SEQ ID NO; 336: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Gly Gly Gly Gly lie Arg 

30 15 

(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
55 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

Ala Ala He Asn Tyr 

1 5 

(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

Gly Val He He Asp 
1 5 
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(2) INFORKATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOUICULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 

Gly Gly Ala Ala Ala Ala Val 

1 5 

(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 

25 Gly Gly Gly Ala Vai Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: S amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 
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40 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 

Gly Pro Thr Aen Lyo 
1 5 

(2) INFORMATION FOR SEQ ID NO: 342: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 

Ala Pro Thr He Aep 

1 5 



(2) INFORMATION FOR SEQ ID NO: 343: 
55 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 7 amino acids 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

to Ala Ala Ala Ala Ala Ala Ala 

1 5 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 

Arg Pro Pro Gly Phe Pro Ser Phe Arg 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



35 



40 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345 

Arg Pro Pro Ph© Gly Pro Ssr Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: eunlno acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

Arg Pro Pro Gly Phe Phe Pro Ser Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 347: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

£5 (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(Li) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 

Arg Pro Pro Gly Phe Phe Ser Pro Arg 
1 5 

(2) INFORMATION FOR SEQ ID NOi 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 

Arg Pro Pro Ser Phe Gly Pro Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 

Arg Pro Phe Gly Pro Pro Ser Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350 

Arg Pro Pro Gly Pro Ser Phe Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 351: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351; 

Arg Phe Pro Pro Gly Ser Pro Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO; 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acido 

(B) TYPE: ainino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

Arg Phe Gly Pro Pro Ser Pro Phe Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 353: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

Phe Arg Pro Ser Phe 
1 5 



35 

Claims 

1. A method for determining the amino acid sequence ofan unknown peptide, which comprises: 

^0 (a) determining a molecular mass and an experimental tragmentation spectrum for the unknown peptide; 

(b) comparing the experimental fragmentation spectrum of the unknown peptide to theoretical fragmentation 
spectra calculated for a peptide library composed of all possible linear sequences of amino acids having a 
total mass that corresponds to the molecular mass of the unknown peptide; and 

(c) identifying a peptide tn the peptide library having a theoretical fragmentation spectrum that matches most 
45 closely the fragmentation spectrum of the unknown peptide. 

2. The method of claim 1 , wherein the molecular mass for the unknown peptide is determined with an accuracy of 
up to about 30 parts per million. 

50 3, The method of claim 1 or claim 2, wherein the total mass of each of the possible linear sequences of amino acids 
is within the range of plus or minus about 30 parts per million of the molecular mass of the unknown peptide 

4. The method of any preceding claim, further comprising calculating an indication of closeness-of-fit between the 
experimental fragmentation spectrum of the unknown peptide and each of the theoretical fragmentation spectra 

55 calculated for the peptide library. 

5. The method of claim 4, further comprising selecting peak values having an intensity greater than a predetermined 
threshold value when calculating the indicatbn of closeness-of-fit. 
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6. The method of any preceding clainn, further comprising nonnalizing the experimental fragmentation spectrum. 

7. The method of any preceding claim, wherein the ammo acids are selected from tryptophan, arginine, histidine, 
glutamic acid, glutamine, aspartic acid, leucine, threonine, proline, alanine, tyrosine, carbamido cysteine, pheny- 

5 lalanine, methionine, lysine, asparagine, isoleucine, cysteine, valine, serine, and glycine. 

8. The method of any of claims 1 to 6, wherein the amino acids comprise non-natural amino acids or chemically 
modified forms of tho naturally occurring amino acids. 

70 9. The method ot any preceding claim, wherein the unknown peptide has a molecular mass greater than about 1 ,400 
Daltons 

10. The method ofany preceding claim, wherein the molecular mass for the unknown peptide is determined using a 
mass spectrometer. 

15 

11. The method of claim 10, wherein the mass spectrometer is a time-of-flight mass spectrometer. 

12. The method of claim 10, wherein the molecular mass and the fragmentatbn spectrum for the unknown peptide 
are determined using a tandem mass spectrometer. 

20 

13. A method according to any preceding claim, which additionally comprises the identification of one or more amino 
acids in the unknown peptide from its experimental fragmentation spectrum, or from its method of preparation, 
and using the one or more identified amino acids to constrain the library of all possible linear sequences. 

25 14. The method of claim 12, wherein the spectrum has an immonium ion regbn, and the immonium region is used to 
identify one or more amino acids contained in the unknown peptide. 

15. The method of claim 13 or claim 14, wherein the identification comprises comparing a known characteristic of 
amino acids with characteristics ofthe experimental fragmentation spectrum. 

30 

16. The method of any of claims 13 to 15, wherein said one or more amino acids is or includes the N-terminal or C- 
terminal amino acid. 

17. A method of generating a library of amino acid sequences, wherein each sequence in the library represents a 
55 peptide having a molecular mass that corresponds to a single, predetermined molecular mass, which comprises 

defining a set of combinations of altowed amino acids having a molecular weight that corresponds to the 
predetermined molecular mass; and 

generating a library of all possible linear sequences of the amino acids in each combination of the set; 



40 



wherein the library is constrained by identification as defined in any of claims 1 3 to 16. 
18. A method according to claim 17, additionally comprising the characteristic ofany of claims 2 to 12. 



45 
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